Parameter-Free Hierarchical Co-clustering by n-Ary Splits

نویسندگان

  • Dino Ienco
  • Ruggero G. Pensa
  • Rosa Meo
چکیده

Clustering high-dimensional data is challenging. Classic metrics fail in identifying real similarities between objects. Moreover, the huge number of features makes the cluster interpretation hard. To tackle these problems, several co-clustering approaches have been proposed which try to compute a partition of objects and a partition of features simultaneously. Unfortunately, these approaches identify only a predefined number of flat co-clusters. Instead, it is useful if the clusters are arranged in a hierarchical fashion because the hierarchy provides insides on the clusters. In this paper we propose a novel hierarchical co-clustering, which builds two coupled hierarchies, one on the objects and one on features thus providing insights on both them. Our approach does not require a pre-specified number of clusters, and produces compact hierarchies because it makes n−ary splits, where n is automatically determined. We validate our approach on several high-dimensional datasets with state of the art competitors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

K-ary Clustering with Optimal Leaf Ordering for Gene Expression Data

MOTIVATION A major challenge in gene expression analysis is effective data organization and visualization. One of the most popular tools for this task is hierarchical clustering. Hierarchical clustering allows a user to view relationships in scales ranging from single genes to large sets of genes, while at the same time providing a global view of the expression data. However, hierarchical clust...

متن کامل

On Randomly Projected Hierarchical Clustering with Guarantees

1 Hierarchical clustering (HC) algorithms are generally limited to small data instances due to their runtime costs. Here we mitigate this shortcoming and explore fast HC algorithms based on random projections for single (SLC) and average (ALC) linkage clustering as well as for the minimum spanning tree problem (MST). We present a thorough adaptive analysis of our algorithms that improve prior w...

متن کامل

ABELIAN STATE-CLOSED SUBGROUPS OF AUTOMORPHISMS OF m-ARY TREES

The group Am of automophisms of a one-rooted m-ary tree admits a diagonal monomorphism which we denote by x. Let A be an abelian state-closed (or self-similar) subgroup of Am. We prove that the recurrence and tree-topological closure A∗ of A is additively a finitely presented Zm [[x]]module where Zm is the ring of m-adic integers. Moreover, if A∗ is torsion-free then it is a finitely generated ...

متن کامل

NEW TYPES OF FUZZY n-ARY SUBHYPERGROUPS OF AN n-ARY HYPERGROUP

In this paper, the new notions of ``belongingness ($in_{gamma}$)"and ``quasi-coincidence ($q_delta$)"  of a fuzzy point with a fuzzyset are  introduced. By means of this new idea, the  concept of$(alpha,beta)$-fuzzy $n$-ary subhypergroup of an $n$-aryhypergroup is given, where $alpha,betain{in_{gamma}, q_{delta},in_{gamma}wedge q_{delta}, ivq}$,  andit is shown that, in 16 kinds of $(alpha,beta...

متن کامل

Visualization, Search and Analysis of Hierarchical Translation Equivalence in Machine Translation Data

Translation equivalence constitutes the basis of all Machine Translation systems including the recent hierarchical and syntax-based systems. For hierarchical MT research it is important to have a tool that supports the qualitative and quantitative analysis of hierarchical translation equivalence relations extracted from word alignments in data. In this paper we present such a toolkit and exempl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009